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' ^ ' Abstract 

We show that a new probabilistic technique, recently introduced by the first author, yields 
the sharpest bounds obtained to date on mixing times of Markov chains in terms of isoperimetric 
properties of the state space (also known as conductance bounds or Cheeger inequalities). We 
prove that the bounds for mixing time in total variation obtained by Lovasz and Kannan, can 
py/ be refined to apply to the maximum relative deviation \p n (x,y)/n(y) — 1| of the distribution 

p j . at time n from the stationary distribution n. We then extend our results to Markov chains 

on infinite state spaces and to continuous-time chains. Our approach yields a direct link be- 
tween isoperimetric inequalities and heat kernel bounds; previously, this link rested on analytic 
estimates known as Nash inequalities. 
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1 Introduction 

>. 

Q\ , It is well known that the absence of "bottlenecks" in the state space of a Markov chain implies 

rapid mixing. Precise formulations of this principle, related to Cheeger's inequality in differential 
(y-j ■ geometry, have been proved by algebraic and combinatorial techniques ^ El El El EE E]- They 

have been used to approximate permanents, to sample from the lattice points in a convex set, to 
estimate volumes, and to analyze a random walk on a percolation cluster in a box. 

In this paper, we show that a new probabilistic technique, introduced in [201, yields the sharpest 
bounds obtained to date on mixing times in terms of bottlenecks. 

Let {p(x, y)} be transition probabilities for an irreducible Markov chain on a countable state 
space V, with stationary distribution ir (i.e., Y^xeV^i^P^^V) = ^{v) f° r x ^0- For x,y £ V, 
let Q(x,y) = n(x)p(x,y), and for S,A C V, define Q(S, A) = J2 s es,a^A Q( s i a )- For S C V, the 
"boundary size" of S is measured by \dS\ = Q(S,S C ). Following [15] . we call $5 := the 
conductance of S. Write 7r* := mm x ^v k(x) and define #(r) for r £ [71"*, 1/2] by 

$(r) = inf{$ 5 : ir{S) < r} . (1) 

For r > 1/2, let $(r) = = <J>(l/2). Define the e-uniform mixing time by 

p n {x,y) - %(y) 
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t \ . / p [x, y - it {y 
rie) = mim n : — < e V x, y G V > . 

1 7r(y) J 
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Jerrum and Sinclair considered chains that are reversible (i.e., Q(x,y) = Q(y,x) for all 
x, y £ V) and also satisfy 

p(x,x) > 1/2 for all x G V . (2) 
They estimated the second eigenvalue of p(-, •) in terms of conductance, and derived the bound 

r(e) <2$- 2 flog— + log -V (3) 

Algorithmic applications of @ are described in [23] . Extensions of (J3J) to non-reversible chains 
were obtained by Mihail ^B] and Fill A striking new idea was introduced by Lovasz and 

Kannan [T^], who realized that in geometric examples, small sets often have larger conductance, 
and discovered a way to exploit this. Let \\p — v\\ = \ J2yev \l J, (v) ~ u {y)\ be the total variation 
distance, and denote by 

ry(e) := minjn : \\p n (x, •) - n\\ < e for all x € Vj (4) 

the e-mixing time in total variation. (This can be considerably smaller than the uniform mixing 
time r(e), see the lamplighter walk discussed at the end of this section, or [JEJ Remark 1.) For 
reversible chains that satisfy ©, Lovasz and Kannan proved that 

Ml/A) < 2000 / (5) 

This formula was the impetus for the present paper. Related formulae for infinite Markov chains 
were obtained earlier from Nash inequalities and are discussed below. (As noted in ^S], there was 
a small error in [2]; the statement above is obtained from §3 in the survey by Kannan |14j.) 

Note that in general, Ty(e) < Ty(l/4) log 2 (l/e). Therefore, ignoring constant factors, the bound 
in JSJ is tighter than the bound of ©, but at the cost of employing a weaker notion of mixing. 

Our main result sharpens © to a bound on the uniform mixing time. See Theorem for a 
version that relaxes the assumption (|2"|). We use the notation a A (3 := min{a, /?}. 

Theorem 1 Assume JJ$. Then the e-uniform mixing time satisfies 

[ 4 /t Adu 

r e <l+ • 6 

J47T, u<5> 2 {u) 

More precisely, if 



4 / £ 4du 
4( 7 r(a;)A7r( 2 /)) U$ 2 (u) 



n>l+ I -^t—- (7) 



then 

p n (x,y) - 7r(y) 



< e. (8) 



7r(y) 

(Recall that <£(r) is constant for r > 5.) This result has several advantages over 

• The uniformity in (jSJ). 

• It yields a better bound when the approximation parameter e is small. 
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• It applies to non-reversible chains. 

• It yields an improvement of the upper bound on the time to achieve (JHJ when Tr(x),n(y) are 
larger than 7r*. 

• The improved constant factors make the bound (jHJ) potentially applicable as a stopping time 
in simulations. Under a convexity condition, these factors can be improved further; see £|H1 
Remark 3. 

Other ways to measure bottlenecks can yield sharper bounds. One approach, based on "blocking 
conductance functions" and restricted to the mixing time in total variation ry, is presented in |141 
Theorem 3]. 

Another boundary gauge ip is defined in ^2] of the present paper. For the n-dimensional unit 
hypercube, this gauge (applied to the right class of sets, see gives a bound of the right order 
r(l/e) = 0(n log n) for the uniform mixing time. Previous methods of measuring bottlenecks did 
not yield the right order of magnitude for the uniform mixing time in this benchmark example. 

Theorem ^ is related to another line of research, namely the derivation of heat kernel estimates 
for Markov chains using Nash and Sobolev inequalities. For finite Markov chains, such estimates 
were obtained by Chung and Yau and by Diaconis and Saloff-Coste [S]. In particular, for the 
special case where $ is a power law, the conclusion of Theorem ^ can be obtained by combining 
Theorems 2.3.1 and 3.3.11 of Saloff Coste [21]. For infinite Markov chains, Nash inequalities have 
been developed for general isoperimetric profiles; see Varopoulos [21], the survey by Pittet and Saloff 
Coste |21| . the book [23]) an d especially the work of Coulhon 00- Even in this highly developed 
subject, our probabilistic technique yields improved estimates when the stationary measure is not 
uniform. Suppose that n is an infinite stationary measure on V for the transition kernel p. As 
before, we define 

Q(x, y) = 7r(x)p(x, y); \dS\ = Q(S, S c ); $ s := 

Define $(r) for r G [it*, oo) by 

$(r) = inf{$ 5 : n{S) < r] . (9) 

Theorem 2 (infinite stationary measure case) 

Suppose that < 7 < \ and p(x, x) > 7 for all x £ V . If 

(I-7) 2 f 4 A Adu 
n > 1 + ^ ^ / —27-- , (10) 

7 J4(n(x)/\n(y)) U9' L [u) 



then 



p n (x,y) 



< e. (11) 



This Theorem is proved in Section 6. For the rest of the introduction, we focus on the case of finite 
stationary measure. 

Definition: Evolving sets. Given V, vr and Q as above, consider the Markov chain {S n } on subsets 
of V with the following transition rule. If the current state S n is S C V, choose U uniformly from 
[0, 1] and let the next state S n+ i be 

S = {y:Q(S,y)>U7r(y)}. 



3 




1/8 < U < 2/8 



Figure 1: One step of the evolving set process. 



Consequently, 



P(y€S) = P(Q(S,y)>Uir(y) 



Q(S,y) 



(12) 



7r(y) 

Figure n illustrates one step of the evolving set process when the original Markov chain is a random 
walk in a box (with a holding probability of |). Since ir is the stationary distribution, and V are 
absorbing states for the evolving set process. 

Write Ps^ ■ j := ■ So = S^j and similarly for Es^ • j. The utility of evolving sets stems 
from the relation 

TT{X) 1 ; 

(see Proposition [7J). Their connection to mixing is indicated by the inequality 



llMn - tt|| < -TT E M V n ( S ") A n ( S n) > 

7T(Xj v 

where fj, n := p n (x, •); see (|24|l for a sharper form of this. The connection of evolving sets to 
conductance can be seen in Lemma 01 below. 

Example 1 (Random Walk in a Box): Consider a simple random walk in an n x n box. 
To guarantee condition (J2J we add a holding probability of | to each state (i.e., with probability | 
do nothing, else move as above). The conductance profile satisfies 

$(u) > 



for 1 < u < 1/2, where a is a constant. Thus our bound implies that the e uniform mixing time is 
at most 

"1/2 1 

C e + Al — ^du = 0(n 2 ), 

a \ 
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Figure 2: A box with holes. 




Figure 3: Random walk in a percolation cluster. 

which is the correct order of magnitude. Of course, other techniques such as coupling or spectral 
methods would give the correct-order bound of 0(n 2 ) in this case. However, these techniques are 
not robust under small perturbations of the problem, whereas the conductance method is. 

Example 2 (Box with Holes): For a random walk in a box with holes (see Figure it is 
considerably harder to apply coupling or spectral methods. However, it is clear that the conduc- 
tance profile for the random walk is unchanged (up a constant factor), and hence the mixing time 
is still 0(n 2 ). 

Example 3 (Random Walk in a Percolation Cluster): In fact, the conductance method 
is robust enough to handle an even more extreme variant: Suppose that each edge in the box 
is deleted with probability 1 — p, where p > ^. Then with high probability there is a connected 
component that contains a constant fraction of the original edges. Benjamini and Mossel [3] showed 
that for the random walk in the big component the conductance profile is sufficiently close (with 
high probability) to that of the box and deduced that the mixing time is still 0(n 2 ). (See Q7] 
for analogous results in higher dimensions.) By our result, this also applies to the uniform mixing 
times. 

Example 4 (Random Walk on a Lamplighter Group): The following natural chain 
mixes more rapidly in the sense of total variation than in the uniform sense. A state of this chain 
consists of n lamps arrayed in a circle, each lamp either on (1) or off (0), and a lamplighter located 
next to one of the lamps. In one "active" step of the chain, the lamplighter either switches the 
current lamp or moves at random to one of the two adjacent lamps. We consider the lazy chain 
that stays put with probability 1/2 and makes an active step with probability 1/2. The path of the 
lamplighter is a delayed simple random walk on a cycle, and this implies that Ty(l/4) = 0(n 2 ), 
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Figure 4: Random walk on a lamplighter group 

see However, by considering the possibility that the lamplighter stays in one half of the cycle 
for a long time, one easily verifies that r(l/4) > cin 3 for some constant c\ > 0. Using the general 
estimate r(e) = 0{jy (e) log (l/V*)) gives a matching upper bound r(l/4) = 0(n 3 ). 



2 Further results and proof of Theorem 1 

We will actually prove a stronger form of Theorem ^ using the boundary gauge 



if)(S) :=1-E S « 



n(S) 



instead of the conductance $5. The next lemma relates these quantities. 

Lemma 3 Let / S C V. If Q] Zio/cfe, i/ien ^(5) > $|/2. More generally, if < 7 < ^ and 
p(x,x) > 7 /or a// x 6 V, then i/>(S) > 2 (i_ 7 )^ ^5- 

See £|Ufor the proof. In fact, is often much larger than <&g. 

Define the roo£ profile ip(r) for r S [71"*, 1/2] by 



^(r) = inf{^(5) : n(S) < r}, 



(13) 



and for r > 1/2, let ?/>(r) := ip* = ip(^). Observe that the root profile ifi is (weakly) decreasing on 
[tt*,oo). 

For a measure ^ on V, write 



7r(y) 



M(y) 2 

7r(y) 



1. 



(14) 



By Cauchy-Schwarz, 



2\\fjL — vr I 



71" | = 


M(0 t 


< 








vr(-) 


L1(tt) - 


vr(-) 





(15) 



We can now state our key result relating evolving sets to mixing. 
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Theorem 4 Denote fi n = p n (x, ■ ). Then x 2 (/u n ,7r) < e for all 

> /- 4 A du 

See §5 for the proof. 

Derivation of Theorem ^ from Lemma El and Theorem H] 

The time-reversal of a Markov chain on V with stationary distribution ir and transition ma- 
trix p(x,y), is another Markov chain with stationary distribution tt, and transition matrix *p (•, •) 
that satisfies vr(y)p(y, z) = Tr(z)*p (z, y) for all y,z £ V. Summing over intermediate states gives 
ir(z)*p m (z, y) = ir(y)p m (y, z) for all z,y £ V and m > 1. 

Since p n+m (x,z) = ^y^y p n {x,y)p m {y, z), stationarity of tt gives 

p n+m (x, z) - tt(z) = £ (p n (x, y) - 7r(y)) (p m (y, *) - n(zj) (16) 



whence 



\ P n+m {x,z) -Tl(z) 



(17) 



I 7r(z) 

1^ W V vr(y) A 7r(y) J V ; 

< X (p n (x,-),ir)x(t m (z,-),ir) (19) 

by Cauchy-Schwarz. 

The quantity Q(S, S c ) represents, for any S C V, the asymptotic frequency of transitions from 
S to S° in the stationary Markov chain with transition matrix p(-, •) and hence Q(S, S c ) = Q(S C , S). 
It follows that the time-reversed chain has the same conductance profile <£(•) as the original Markov 
chain. Hence, Lemma El and Theorem 0] imply that if 

, 4 A 2du 

m,£> 

and © holds, then 



(n(x)An(y)) U® 2 (u) ' 

x(p e (x,-),n) <Ve and X (*P m (z, •),*■) 



Thus by (fT3|) . 

;/ +m (x, 2f) — 7r(z) 



7r(z) 



< e. 



and Theorem^ is established. 

In fact, the argument above yields the following more general statement. 

Theorem 5 Suppose that < 7 < \ an d p( x > x ) — 7 f or all x £ V. If 

(1_ 7 )2 ,4/e Mu 

n > 1 + ^ ^- / — ^ , 20 

i/ien holds. 

To complete the proof of Theorems ^ and it suffices to prove Lemma |3] and Theorem 0J This 
is done in and £JSJ respectively. 
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3 Properties of Evolving Sets 

Lemma 6 The sequence {vr(5 n )} Tl >o forms a martingale. 
Proof: By (|12|) . we have 

E (jr(S n+1 )\s n ) = *(V) P (v G S n+i S, 
y&V 

= Q( s n,y) = n(Sr, 

y&V 



The following proposition relates the nth order transition probabilities of the original chain to the 
evolving set process. 

Proposition 7 For all n > and x, y £ V we have 

p n ( X ,y) = ^lp {x} (y £ S n ). 
ir{x) 1 ' 

Proof: The proof is by induction on n. The case n = is trivial. Fix n > and suppose that the 
result holds for n — 1. Let U be the uniform random variable used to generate S n from S n -x. Then 



P n (x,y) = J2p n 1 (x,z)p(z,y) 



E p w(*^i)^j>(*,v) 

^Me {x} f_LQ(<? n _ 1)2/ ) 
7r(x) 1 ; V7r(y) 

7r(x) 



P{x}(y e s n ). 



We will also use the following duality property of evolving sets. 

Lemma 8 Suppose that {S n } n >o is an evolving set process. Then the sequence of complements 
{Sn}n>o is also an evolving set process, with the same transition probabilities. 

Proof: Fix n and let U be the uniform random variable used to generate <S n +i from S n . Note 
that Q(S n ,y) + Q(S°,y) = Q(V,y) = vr(y). Therefore, with probability 1, 

S c n+ i = {y.Q(S n ,y)<U7r(y)} 

= {y:Q(S c n ,y)>(l-U)7r(y)}. 

Thus, {S^} has the same transition probabilities as {S n }, since 1 — U is uniform. □ 
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Next, we write the \ 2 distance between fi n := p n (x, ■ ) and tt in terms of evolving sets. Let 
{S n } n >o and {A n } n >o be two independent replicas of the evolving set process, with So = Aq = {x}. 
Then by (fTl|) and Proposition [7[ x (/in,7r) equals 



(21) 

[ ]T <y)r { J{y e s n } n { y e A n }) - vr(x) 2 ] (22) 

7T[ - X) ydV 

— —= B {x} (7T(S n n A n ) - 7T(S n )7T(A n )) , (23) 



where the last equation uses the relation tt(x) = Er x \Tr(S n ) = Er x \7r(A Tl ). For any two sets 
S, A C V, 

ir(S n A) + ^(5 C n A) = tt(A) = 7r(S)7r(A) + it(S c )it(A), 

and hence 

\tt(S n A) - 7r(S)7r(A)| = |vr(,S c n A) - ^(5 c )vr(A)|. 
Similarly, this expression doesn't change if we replace A by A c . Thus, if we denote 

s t . = f s if 7r ( 5 ') ^ b 
I S c otherwise, 

then 

|7r(Sn A) - 7r(5)7r(A)| = |vr(5 s n A a ) - vr(5 a )vr(A a )| 

< |vr(5 s ) Avr(A»)| 

< ^7r(58)7r(A») . 

Inserting this into 1)23(1 . we obtain 



X'(^n,vr) < E V /vr(^)vr(A? l ) 

whence 

1 



2||/in-vr|| < X (^n,vr) < — -EVvr(5»). (24) 

7T(X) 

4 Evolving sets and conductance profile: proof of Lemma 3 

Lemma 9 For every real number (3 G [— 5 , 5]? we ^o^e 



< V 1 "/ 32 ^ 1-/3 /2. 
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Proof: Squaring gives the second inequality and converts the first inequality into 

1 + 2(3 + 1 - 2(3 + 2y / l-4/? 2 < 4(1 - (3 2 ) 
or equivalently, after halving both sides, 

'1-4/3 2 < 1 — 2/3 2 , 

which is verified by squaring again. □ 



Lemma 10 Let 

<ps 



y&v 

Then 



: =^(s) E(^(^) A ^ c >y))- ( 25 ) 



i - m < vi + '^s + vi-'^ < 1 _ ^ /2 (26) 

Proof: The second inequality in (|26[) follows immediately from Lemma To see the first inequal- 
ity, let U be the uniform random variable used to generate S from S. Then 

l ) n(y) 



Consequently, 



7r(y)P 5 (y G S | £/ < i) = Q(S, y) + (Q{S C , y) A Q(S, y) 
Summing over y G V, we infer that 

E 5 (tt(5) | £7 < i) = tt(5) + 2ir(S) Vs ■ (27) 

Therefore, R := ir(§)/ir(S) satisfies E S ( J R|C7 < \) = 1 + 2^. Since E 5 i? = 1, it follows that 

E s (i?|£/>±) = 1-2^ • 

Thus 

l-if>(S) = E(v A R) 

E(^R|[/ < i) + E(-/R|Z7 > i) 



y /E(fl|C/<^) + ^/E(i;|f7>^) 
2 

by Jensen's inequality (or by Cauchy-Schwarz). This completes the proof. 
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Proof of Lemma |3J If p(y, y) > 1/2 Vj/ S 7, then it is easy to check directly that ip$ = $5 for 
all ScV. 

If we are only given that p{y,y) > 7 Vy £ V, where < 7 < |, we can still conclude that for 

yes, 

Q(S, y) A Q(S C , y) > 7 vr(y) A Q(S C , y) > ->—Q(S c , y) . 

1-7 

Similarly, for y £ S c we have Q(S,y) A Q(S c ,y) > j^Q(S,y). Therefore 

£ [Q(S, y) A Q(S C , y)] > -^-Q(S, S c ) , 

whence ^5 > jzzz&s- This inequality, in conjunction with Lemma ITU1 yields Lemma |3 □ 



5 Proof of Theorem |4] 

Denote by K{S,A) = ~Ps(S = A) the transition kernel for the evolving set process. In this section 
we will use another Markov chain on sets with transition kernel 

K(S,A) = ^-K(S,A). (28) 

This is the Doob transform of K(-, •). As pointed out by J. Fill (Lecture at Amer. Inst. Math. 2004), 
the process defined by K can be identified with one of the "strong stationary duals" constructed 
in®. _ 

The martingale property of the evolving set process, Lemma |BJ implies that ^ A K{S, A) = 1 
for all S C V . The chain with kernel (|28|) represents the evolving set process conditioned to absorb 
in V; we will not use this fact explicitly. 

Note that induction from equation l(2*5|) gives 

K n (S,A) = ^-K n (S,A) 



for every n, since 



K n+1 (S,B) = ^r(5,A)f(A,B) 

A 

= e4§^(^was) 

n{B) K n+ \S,B) 



tt(S) 

for every n and B C V. Therefore, for any function /, 

~it(S n 



tt(S) f{Sn) 



(29) 
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where we write E for the expectation when {S n } has transition kernel K. Define 



tt(SI) 

and note that n(S n ) = Z~ 2 when Z n > y/2, that is, when ir(S n ) < \. Then by equations (|29|) and 
J21, X(^n,vr) < B {x} (Z n ) and 



E 



Z 



n+l 



Z n 



E 



E 





Z n +1 


vr(5 n ) 




Z n 




l) 







(30) 



< l-V(vr(5„)) = l-/ (Z n ), (31) 

where fo(z) := ip(l/z 2 ) is nondecreasing. (Recall that we defined tp(x) = ip* for all real numbers 
x > 2-) Let Lq = Zq = ii(x)~ l l 2 . Next, observe that E(-) is just the expectation operator with 
respect to a modified distribution, so we can apply Lemma ITT1 below, with E in place of E. By part 
(iii) of that lemma (with 5 = \fe) , for all 



n > 



L o 2dz 



s zf Q (z/2) J s z^(4/z 2 ) 



L o 2dz 



(32) 



we have x(A t n> 7r ) < ^{x}(^n) < 8- The change of variable u = 4/z 2 shows the integral (|32|) equals 



This establishes Theorem @J 



4 / 52 du f i/€ du 
< 



tt( x ) U1p(u) Ja-k{x) U1p(u)' 



Lemma 11 Let /, /o : [0, oo) — > [0,1] be increasing functions. Suppose that {Z„} n > are non- 
negative random variables with Zq = Lq. Denote L n = ~E(Z n ). 

(i) If L n - L n+1 > L n f(L n ) for all n, then for every n > //"> " ' 



IS zf(z)> 



we have L n < 5. 



(ii) If E(Z n+ i\Z n ) < Z n (l — f(Z n )) for all n and the function u \— > uf(u) is convex on (0, oo), 
then the conclusion of (i) holds. 

(iii) If E(Z n+ i\Z n ) < Z n (l — fo(Z n )) for all n and f{z) = fo(z/2)/2, then the conclusion of (i) 
holds. 

Proof: (i) It suffices to show that for every n we have 

" Lt » dz 



L n zf{z) 



> n. 



(33) 



Note that for all k > we have 



Lk+i < Lk 



1 - f(L k ) 



< L k e~ f{Lk) 
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whence 

f Lk dz > 1 r L *> dz 1 ^ L k > 

Summing this over A; G {0, 1, . . . ,n — 1} gives (|3*3*|) . 

(ii) This is immediate from Jensen's inequality and (i). 

(iii) Fix n > 0. We have 

E (Z n - Z n+1 ) > E [2Z n f(2Z n )] > L n f(L n ) , (34) 
by Lemma PHU below. This yields the hypothesis of (i). □ 

The following simple fact was used in the proof of Lemma 1111 

Lemma 12 Suppose that Z > is a nonnegative random variable and f is a nonnegative increasing 
function. Then 

E(Z/(2Z)) >^f-f(EZ). 
Proof: Let A be the event {Z > EZ/2}. Then E(Zl A o) < EZ/2, so E(Z1 A ) > EZ/2. Therefore, 

E(Z/(2Z)) > E(Z1 A • /(EZ)) > ^ff(EZ) . 

□ 



6 Infinite stationary measures: proof of Theorem [2] 

Proof: For a probability measure /i on V, define x 2 (/ x ) 7r ) by 



We now write x 2 (/%, tt) in terms of evolving sets. Let {5 n } n >o and {A n } n >o be two independent 
replicas of the evolving set process, with 5o = Ao = {x}. Then by (|35jl and Proposition Q 

ydV ^ X > 



7r(x) 



^[ £ <y)?({y g M nfee An})] (37) 



-^_E(vr(5 n n A n )) < (sf^S^K)) . (38) 



whence 



X(^n,vr) < — -EJn(S n ). (39) 

7T (X) v 
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As in the finite case, if K is the Doob transform of K with respect to 7r, then 

~n(S n ) 



Esf(S n ) = Es 



Define 



1 



'f(Sn) 



(40) 



Then by equations (|4TI|) and xil^n,^) < E{ ;I .}(Z n ) and 



E 



+1 



E 



E 



^/vr(5 n+1 ) 



^91 



5,, 



< l-V(vr(5„)) = l-/o(^n), 
where fo(z) = ^(l/z 2 ) is increasing. Let Lq = Zq = ^(x)" 1 / 2 . By Lemma ITTT iii) above, for all 



(41) 



n > 



L o 2dz 



^ zf (z/2) z^jzi) 



L o 2dz 



(42) 



we have xil^n, tt) < E{ a; j(Z n ) < \[e. The change of variable u = 4/z 2 shows the integral ()42|) equals 

4 A , (1 - 7 ) 2 /" 4 /« 2du 



Hn(x) UTp(u) 7 2 Att^) li^ 2 ^) 

Let *p denote the time-reversal of p(-, ■). Then for all 

(1_ 7 )2 ,4/ e 2du 



we have 



Thus 



T J4,(n(x)An(y)) U® Z (U) 

X (p n (x,-),ir) < and X (t m {z,-),ir) < y/i. 
p n+m (x, z) 



ir(z) 



1 



ir(z) 



Y,P n (x,y)p m (y,z) 



5>«(^)( 



y&V 



n{y) > V 7r(y) 



< 



x(p ri (x ! -) ! 7r) X ( < p m (^0,7r) <e, 



(43) 

(44) 
(45) 



where the first inequality is Cauchy-Schwarz. This establishes Theorem |^J 
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7 Continuous Time 



In this section we extend our results to continuous-time, finite chains. We consider the chain 
{Xt,t > 0} that moves at rate 1 according to P, where P(x,y) is a transition kernel on V with 
stationary distribution tt. Let $p be the conductance profile for P. 



Theorem 13 Let Xt be a continuous-time, finite chain with transtion kernel ht 
the e-uniform mixing time satisfies 

r 4 A 8du 



r(e)< 



More precisely, if 



then 



t > 



4/6 



47r „ U$ 2 {u) ' 

8du 



(n(x)ATT(y)) U<& 2 (U 

h t (x,y) - vr(y) 



Proof: As before, it is enough to show that x 2 (P(^t 

/•4A 4du 

t > 



< e. 



e t(P-/)_ Then 
(46) 

(47) 
(48) 



, 7r) < e for all 



Utv(x) U§ 2 (u) ' 

Consider the Markov operator P = ^(P + J) with corresponding transition probabilities p(-, •). 
Let <3? and tp be the conductance profile and root profile of P, respectively. Note P satisfies condition 
(J2J) so Theorems 0] and n apply- Let {Xt} be the chain with transition kernel ht = e 4 ( p_ ^. Observe 



that e 2 *( p - 7 ) 



AP-i) 



$ = 77$ and -0 > , it is enough to show that x^it^t, 7r ) < e f° r all 



, so Xit has the same law as X t . Let fit = ^t(a^ •) = ht/2( x i •). Since 



t > 



du 



4/e 

47r(z) Ul/j(u) 



We accomplish this using the natural continuous-time evolving set process corresponding to Xt- 
Let {S(t) : t > 0} be the process which at rate 1 moves according to the evolving set transition 
kernel for P. Let {S n : n > 0} be the (discrete time) evolving set process for P. Note that 



P x (X t = y) 



E 

3=0 L 

oo 

E 

3=0 L 
7r(x) 



p j (x,y) 



ir[x) 



P{x}(y G <Sj-) 



Pw(»6S(t))- 



(49) 
(50) 
(51) 



Our proof will parallel the proof of Theorem 2] One can argue as in Section |3] to obtain xil^t, ?r) < 
^E^{S[). Define 
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and let L t = E(Z t ), so that x(lH^) < Lt- Note that 



j=0 L 



L; 



is differentiable. Equation (|31j) implies that 



<l-/o(^ 



where D is the event that the evolving set process {£(•)} makes exactly one transition in (t, t + e]. 
It follows that for all t > we have 

E(Z t - Z t+e \S t ) > eZ t fo(Z t ) + 0(e 2 ). 

Fix t > 0. Taking expectations above, we get 



E (Z t - Z t+e ) > eE [2Z t f(2Z t j\ + 0(e 2 ) > eL t f(L t ) + 0(e 2 



where the last inequality holds by Lemma IT21 

It follows that Lt+e — Lt < —eLtf(Lt) + 0(e 2 ) and hence 

^ < -It/(It). 
The following Lemma is an analog of Lemma 1111 
Lemma 14 For every 

t > [ T ° dZ 
~ J 8 Zf(z) ' 

we have L% < S. 

Proof: It's enough to show that for all t > we have 

r L ° dz 

lit zf(z) ' 

This is an equality for t = 0, and differentiating both sides gives 

1 < 



t < 



Ltf(L 



1 n 



which holds by equation (|53|) . 
Lemma El i m phes that for all 



t > 



L o 2dz f L ° 2dz 



s zf (z/2) J s zi/>(4/z*) 



(52) 



(53) 



(54) 



(55) 



we have x(^t; 7r ) < L t < 5. Let e = <5 2 . We calculated after (|4*2*|) that the integral on the righ-hand 
side of ((53|) equals 

f 4 A du 



4tt(x) 



This establishes Theorem 1131 
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Concluding remarks 



1. The example of the lamplighter group in the introduction shows that Ty(l/4), the mixing 
time in total variation on the left-hand side of (JSJ), can be considerably smaller than the 
corresponding uniform mixing time r(l/4) (so an upper bound for r(-) is strictly stronger). 
We note that there are simpler examples of this phenomenon. For lazy random walk on a clique 
of n vertices, ry (1/4) = 0(1) while r(l/4) = O(logn). To see a simple example with bounded 
degree, consider a graph consisting of two expanders of cardinality n and 2 n , respectively, 
joined by a single edge. In this case ry(l/4) is of order @(n), while r(l/4) = 0(n 2 ). 

2. Let X n be a finite, reversible chain with transition matrix P. Write fj,® := p n (x, •). Equation 
(J2U) gives 

x(K>*) < -^rEV^OsI) < -±=(i - AT ■ (56) 



ir(x) 



Let fi : V — > R be the second eigenfunction of P and A2 the second eigenvalue, so that 
Ph = ^2/2- For x £ V, define f x :V^Hby f x (y) = S x (y) - n(y), where 5 is the Dirac 
delta function. We can write /a = J2xeV a xfx- Hence 



P n f(-) 



7T-) 



L\-k) 



i 2 W 



tt(-) 

= J! a ^X(^n^) 

a; 

< const • maxx(^,i 

< const • (1 -ip*) n , (60) 
where the first line is subadditivity of a norm and the last line follows from (|56|) . But 



(57) 

(58) 
(59) 



P n f{-) 



tt(-) 



i 2 W 



> 



pn/(. 



vr(-) 



^|P"/ 2 (x)|=A^|/ 2 (x)|. 



(61) 



Combining (|H0|) and (fSTf) gives Af < c • (1 — ip*) n for a constant c. Since this is true for all n, 
we must have A2 < 1 — ip* , so ip* is a lower bound for the spectral gap. 

3. Variants of conductance can give better bounds on tps- F° r S dV, define 

0s, 1 



tt(S) 



y&S 



Note that for reversible chains we have 



0s 



1 



tt(S) 



7T (y)yp(y^ Sc )^ 



yes 



which is strictly greater than $5 for S £ {0, V} (since &s can be written in a similar way, 
but without the square root.) 

Following Houdre and Tetali ^2]) denote 



hi 



mi{e s :7r(S) < i} 



(62) 
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Theorem 15 Suppose that p(x,x) > | for allx. Then 

1>{S) > 



9 2 



81og(2/0f) • 

Consequently, assuming reversibility, the spectral gap 1 — A2 satisfies 



1 - A 2 > 



Hog (2/(fc 



i? 



> c- 



(fr 2 + ) 2 

log(4)l 



(63) 



Up to the logarithmic factor in the denominator, this type of inequality was conjectured by 
Houdre and Tetali [TJl Remark 3.5]. 

Proof: For u G [0, 1], let A u = {y : Q(S,y) > un(y)}. Note that A u C S for u > | since 
p(ic, > i for all x. We have 



E, 



1 / 7t(A) 

V < S ) 

1 / 



du 



n(S) 

< 1 + 1 Z 1 ^)-^ du 
S +2 7o tt(5) 

x /* (tt(4.) - tt(S)) 2 



8 /l 

2 



7^ 



(in, 



by the inequality y/TTt <l + t/2- lt 2 l{t < 0), valid for t > -1. Define 

B t :=5-^a_ t = {y G S : Q(S,y) < (1 - t)*(y)} 
= {yeS:Q(S c ,y)>t7r(y)}. 

The middle term in Q64|) vanishes by the martingale property. Thus 

8 7o vr(5) 2 a ' 

where we have made the substitution t = 1 — u. Therefore, for any a G (0, ~] we have 

1 1 
8tt(5) 2 ^(5) • log(l/2a) > [ 2 ir(B t ) 2 dt ■ [ 2 - dt 



> 



5 TrfB* 



(64) 
(65) 
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by Cauchy-Schwarz. But 



2 ir(B t 



Vt 



dt 



l 

yes Ja vt 

Q(S c ,y)/n(y) 1 

—=db 



> Y,<v) [ 

yes 



V 



= 2 \J2^Q(S c ,yMy)\ -2V^-tt(5) 
Setting a = Og/4 and using equation , we get 



8^(S)log(2/0f)> 



2 TrQgt) 
e|/4 \/t 



dt 



> 



and the theorem follows. 



4. Theorems @] and [5] can be improved under a convexity condition that holds in many examples. 
In the setting of Theorem if ip(r) > tp c ( r ) for all r > 7r* where u t—* uifj c (u~ 2 ) is a convex 
function of u £ [0, do), then X 2 (/-i n , 7r) < e for all 



Itt(x) U1p c (ll) 

To prove this, follow the proof of Theorem 21 until Q3U|) . which implies that 



2r n 



where /(z) := tp c ( z 2 )- Then invoke Lemma fTTT ii') and apply the change of variable u = 1/z 



S n )<l- M<Sn)) = 1 " f(Z n ) , 

). iiicu mvurvc mamno iiii ui auu ayyLy uuc unaUge of Variable 1i = l/^ 2 

to the integral there. 

Similarly, the following variant of Theorem holds. Suppose that p(x,x) > 7 for all x G V. 
If $(r) > 3> c (r) for all r > 0, where u 1— > u^ 2 (u~ 2 ) is a convex function of u £ [0, 00), then 



holds for all 



. (1-7) 2 f 1 ^ 2du 

V Jir(x)ATr(y) U$ 2 c (u) 



5. Let £ denote the support of the evolving set process. Theorem |I] can be improved by using 
ips( r ) = inf{ , i/ ; (5') : ir(S) < r, S € £}, instead of ift(r). For random walk on the n dimensional 
hypercube, £ consists of Hamming balls, ipe(r) > ^log(l/r) and this gives an upper bound 
of O(nlogn) for the uniform mixing time r(l/4). 

More generally, for any Markov chain {X n } on a poset with a monotone time-reversal, if Xq 
is a maximal (or minimal) state, then £ consists of increasing (respectively, decreasing) sets. 

Acknowledgments. We are grateful to D. Aldous, L. Lovasz , R. Lyons, R. Montenegro, E. 
Mossel and A. Sinclair for useful discussions and comments. 



19 



References 

[1] Alon, N. (1986). Eigenvalues and expanders. Combinatorica 6, 83-96. 

[2] Alon, N. and Milman, V. D. (1985). Ai, Isoperimetric inequalities for graphs and superconcen- 
trators, J. Combinatorial Theory Ser. B 38, 73-88. 

[3] Benjamini, I. and Mossel, E. (2003). On the mixing time of a simple random walk on the super 
critical percolation cluster. Probab. Th. Rel. Fields 125, 408-420. 

[4] Chung, F. R. K. (1996) Laplacians of graphs and Cheeger's inequalities. In Combinatorics, 
Paul Erdos is eighty, Vol. 2 , 157-172, J. Bolyai Soc. Math. Stud., Budapest. 

[5] Chung, F. R. K. and Yau, S. T. (1995) Eigenvalues of graphs and Sobolev inequalities, Com- 
binatorics, Probability and Computing 4, 11-26. 

[6] Coulhon, T. (1996). Ultracontractivity and Nash type inequalities. J. Fund. Anal. 141, 510- 
539. 

[7] Coulhon, T., Grigoryan, A. and Pittet, C. (2001). A geometric approach to on-diagonal heat 
kernel lower bounds on groups. Ann. Inst. Fourier (Grenoble) 51, 1763-1827. 

[8] Diaconis, P. and Fill, J. A. (1990) Strong stationary times via a new form of duality. Ann. 
Probab. 18, 1483-1522. 

[9] Diaconis, P. and Saloff-Coste, L. (1996). Nash inequalities for finite Markov chains. J. Theoret. 
Probab. 9, 459-510. 

[10] Fill, J. A. (1991). Eigenvalue bounds on convergence to stationarity for nonreversible Markov 
chains, with an application to the exclusion process. Ann. Appl. Probab. 1, 62-87. 

[11] Haggstrom, O. and Jonasson, J. (1997). Rates of convergence for lamplighter processes. 
Stochastic Process. Appl. 67, 227-249. 

[12] Houdre, C. and Tetali, P. (2004). Isoperimetric Invariants for Product Markov Chains and 
Graph Products. Combinatorica 24, 359-388. 

[13] Jerrum, M. R. and Sinclair, A. J. (1989). Approximating the permanent. SIAM Journal on 
Computing 18, 1149-1178. 

[14] Kannan, R. (2002). Rapid Mixing in Markov Chains Proceedings of International Congress of 
Math. 2002, Vol. Ill, 673-683. 

[15] Lawler, G. and Sokal, A. (1988). Bounds on the I? spectrum for Markov chains and Markov 
processes: a generalization of Cheeger's inequality. Trans. Amer. Math. Soc. 309, 557-580. 

[16] Lovasz, L. and R. Kannan, R. (1999). Faster mixing via average conductance Proceedings of 
the 27th Annual ACM Symposium on theory of computing. 

[17] Mathieu, P. and Remy, E. (2004). Isoperimetry and heat kernel decay on percolation clusters. 
Ann. Probab. 32, 100-128. 



20 



[18] Mihail, M. (1989). Conductance and convergence of Markov chains - A combinatorial treatment 
of expanders. Proceedings of the 30th Annual Conference on Foundations of Computer Science, 
526-531. 

[19] Montenegro, R. and Son, J.-B. (2001) Edge Isoperimetry and Rapid Mixing on Matroids and 
Geometric Markov Chains, Proceedings of the 33rd Annual ACM Symposium on theory of 
computing. 

[20] Morris, B. (2002). A new, probabilistic approach to heat kernel bounds. Lecture at Sectional 
AMS meeting, Atlanta, GA, March 2002. 

[21] Pittet, C. and Saloff-Coste, L. (2002) A survey on the relationships between volume growth, 
isoperimetry, and the behavior of simple random walk on Cayley graphs, with examples. Un- 
published manuscript, available at http://www.math.cornell.edu/~lsc/lau.html 

[22] Saloff-Coste, L. (1997). Lectures on finite Markov chains. Lecture Notes in Math. 1665, 
Springer, Berlin, 301-413. 

[23] Sinclair, A. (1993). Algorithms for Random Generation and Counting: A Markov Chain Ap- 
proach, Birkhauser, Boston. 

[24] Varopoulos, N. Th. (1985) Isoperimetric inequalities and Markov chains. J. Fund. Anal. 63, 
215-239. 

[25] Woess, W. (2000). Random walks on infinite graphs and groups. Cambridge Tracts in Mathe- 
matics 138, Cambridge University Press. 



21 



